Adding Emotions to Malay Synthesized Speech Using Diphone-based Templates
نویسندگان
چکیده
This paper concerns the addition of an affective component to Fasih, one of the first Malay Textto-Speech systems developed by MIMOS Berhad. The goal is to introduce a new method of incorporating emotions to Fasih by building an emotions filter that is template-driven. The templates are diphone-based emotional templates that can portray four types of emotions, i.e. anger, sadness, happiness and fear. A preliminary experiment that focused on showed that the recognition rate of Malay synthesized speech is over 60% for anger and sadness.
منابع مشابه
Template-driven Emotions Generation in Malay Text-to-Speech: A Preliminary Experiment
This paper describes the pilot experiment conducted for the purpose of adding an affective component to the first Malay Text-to-Speech (TTS) system, Fasih. The aim is to test a new method of generating an expressive speech via a template-driven system based on diphones as the basic sound. The synthesized expressive speech can express four types of emotion. However, as an initial test the pilot ...
متن کاملIntegrating rule and template-based approaches for emotional Malay speech synthesis
The manipulation of prosody, including pitch, duration and intensity, is one of the leading approaches in synthesizing emotion. This paper reports work on the development of a Malay Emotional synthesizer capable of expressing four basic emotions, namely happiness, anger, sadness and fear for any form of text input with various intonation patterns using the prosody manipulation principle. The sy...
متن کاملComplex Emotions - the Simultaneous Simulation of Emotion-related States in Synthesized Speech
We describe an approach to simulate first and secondary emotional expression in synthesized speech simultaneously by targeting different parameter categories. The approach is based on the open-source system “Emofilt” which utilizes the diphone-synthesizer “Mbrola”. The evaluation of the approach by a perception experiment showed that the pure emotions were all recognized above chance. Whereas t...
متن کاملProsodic Analysis and Modelling for Malay Emotional Speech Synthesis
This paper discusses an emotional prosody generator for a Malay speech synthesis system that can re-synthesize the selected vocal emotion from neutral synthesized speech output and improve the naturalness by adopting rulebased prosody conversion techniques. The role of prosodic features in emotional expression, particularly fundamental frequency and duration, has been widely investigated in sev...
متن کاملمراحل و نحوه ی تهیه ی دادگان های صوتی هجایی و دایفونی برای سامانه ی تبدیل متن به گفتار فارسی
Abstract Speech databases are part of the concatenative text to speech synthesis systems. Phonetic quality of the databases plays a significant role in the naturalness of the synthesized speech. This paper introduces two syllable and diphone speech databases for Persian and investigates the way of their development and their specifications and their advantages to each other. ...
متن کامل